The REPRO server: finding protein internal sequence repeats through the Web.

نویسندگان

  • R A George
  • J Heringa
چکیده

European Molecular Biology Laboratory in Heidelberg, Germany, in 1992, the late Sir Karl Popper remarked that ‘science is like a black man searching a black room for a black cat that might not be there’. A researcher having a protein sequence at hand and wishing to find internal sequence repeats is faced with exactly this problem. Worse still, he or she might not even know what a cat is. If some help is available in that a motif is known, or there is a notion of amino acid conservation (i.e. the cat is known), then a myriad of software is available to use this information and delineate the reincarnations of the motif or pattern (for a list of available software, see http://www. expasy.ch/tools/#pattern). However, if no such patterns are available, the problem of recognizing and delineating repeats becomes distinctly more difficult (for a review, see Ref. 1). With the availability of complete genome sequences, it is becoming clear that sequence repeats are ubiquitous TIBS 25 – OCTOBER 2000

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Swelfe: a detector of internal repeats in sequences and structures

UNLABELLED Intragenic duplications of genetic material have important biological roles because of their protein sequence and structural consequences. We developed Swelfe to find internal repeats at three levels. Swelfe quickly identifies statistically significant internal repeats in DNA and amino acid sequences and in 3D structures using dynamic programming. The associated web server also shows...

متن کامل

FAIR: A server for internal sequence repeats

UNLABELLED An Internet computing server has been developed to identify all the occurrences of the internal sequence repeats in a protein and DNA sequences. Further, an option is provided for the users to check the occurrence(s) of the resultant sequence repeats in the other sequence and structure (Protein Data Bank) databases. The databases deployed in the proposed computing engine are up-to-da...

متن کامل

De novo identification of highly diverged protein repeats by probabilistic consistency

MOTIVATION An estimated 25% of all eukaryotic proteins contain repeats, which underlines the importance of duplication for evolving new protein functions. Internal repeats often correspond to structural or functional units in proteins. Methods capable of identifying diverged repeated segments or domains at the sequence level can therefore assist in predicting domain structures, inferring hypoth...

متن کامل

Investigation on Reliability Estimation of Loosely Coupled Software as a Service Execution Using Clustered and Non-Clustered Web Server

Evaluating the reliability of loosely coupled Software as a Service through the paradigm of a cluster-based and non-cluster-based web server is considered to be an important attribute for the service delivery and execution. We proposed a novel method for measuring the reliability of Software as a Service execution through load testing. The fault count of the model against the stresses of users ...

متن کامل

Finding Alu in primate genomes with AF‐1

UNLABELLED Repetitive sequences occupy more than 40% of the human genome which is much larger compared to the 2% occupied by the coding DNA. Amongst these Alu elements are the second largest class of repeats, occupying nearly 10% of the whole genome. Alus have been implicated in many genomic processes, sometimes giving rise to aberrations while many times playing as silent player in genomic and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Trends in biochemical sciences

دوره 25 10  شماره 

صفحات  -

تاریخ انتشار 2000